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Abstract. A new generalized Statistical Complexity Measure (SCM) 

was proposed by Rosso ct al in 2010. It is a functional that captures the 
notions of order/disorder and of distance to an equilibrium distribution. 
The former is computed by a measure of entropy, while the latter depends 
on the definition of a stochastic divergence. When the scene is illuminated 
by coherent radiation, image data is corrupted by speckle noise, as is 
the case of ultrasound-B, sonar, laser and Synthetic Aperture Radar 
(SAR) sensors. In the amplitude and intensity formats, this noise is 
multiplicative and non-Gaussian requiring, thus, specialized techniques 
for image processing and understanding. One of the most successful 
family of models for describing these images is the Multiplicative Model 
which loads, among other probability distributions, to the CJ*^ law. This 
distribution has been validated in the literature as an expressive and 
tractable model, deserving the "universal" denomination for its ability 
to describe most types of targets. In order to compute the statistical 
complexity of a site in an image corrupted by speckle noise, we assume 
that the cquiUbrium distribution is that of fully developed speckle, namely 
the Gamma law in intensity format, which appears in areas with little or 
no texture. We use the Shannon entropy along with the Bellinger distance 
to measure the statistical complexity of intensity SAR images, and we 
show that it is an expressive feature capable of identifying many types of 
targets. 
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1 Introduction 



Synthetic Aperture Radar (SAR) is a prominent source of information for many 
Remote Sensing applications. The data these devices provides carries information 
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which is mostly absent in conventional sensors which operate in the optical 
spectrum or in its vicinity. SAR sensors are active, in the sense that they carry 
their own illumination source and, therefore, are able to operate any time. Since 
they operate in the microwaves region of the spectrum, they are mostly sensitive 
to the roughness and to the dielectric properties of the target. The price to pay for 
these advantages is that these images are corrupted by a signal-dependent noise, 
called speckle, which in the mostly used formats of SAR imagery is non-Gaussian 
and enters the signal in a non-additive manner. This noise makes both automatic 
and visual analysis a hard task, and defies the use of classical features. 

This paper presents a new feature for SAR image analysis called Generalized 
Statistical Complexity. It was originally proposed and assessed for one-dimensional 
signals, for which it was shown to be able to detect transition points between 
different regimes [TH]. This feature is the product of an entropy and a stochastic 
distance between the model which best describes the data and an equilibrium 
distribution |12|13! . 

The statistical nature of speckled data allows to propose a Gamma law as the 
equilibrium distribution, while the 5" model describes the observed data with 
accuracy. Both the entropy and the stochastic distance are derived within the 
framework of the so-called (/i, (f>) entropies and divergences, respectively, which 
stem from studies in Information Theory. 

We show that the Statistical Complexity of SAR data, using the Shannon 
entropy and the Hellinger distance, stems as a powerful new feature for the 
analysis of this kind of data. 



2 The Multiplicative Model 

The multiplicative model is one of the most successful frameworks for describing 
data corrupted by speckle noise. It can be traced back to the work by Goodman [9] , 
where stems from the image formation being, therefore, phenomenological. The 
multiplicative model for the intensity format states that the observation in every 
pixel is the outcome of a random variable Z : 12 — > M+ which is the product of 
two independent random variables: X : — >■ K_|_, the ground truth or backscatter, 
related to the intrinsic dielectric properties of the target, and Y : ^ the 
speckle noise, obeying a unitary mean Gamma law. The distribution of the return, 
Z = XY, is completely specified by the distributions X and Y obey. 

The univariate multiplicative model began as a single distribution for the 
amplitude format, namely the Rayleigh law [TU], was extended by Yueh et al. [23] 
to accommodate the K law and later improved further by Frery et al. [7] to the 
G distribution, that generalizes all the previous probabihty distributions. Gao [5] 
provides a complete and updated account of the distributions employed in the 
description of SAR data. 

For the intensity format which we deal with in this article, the multiplicative 
model reduces to, essentially, two important distributions, namely the Gamma 
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and the Q laws. The Gamma distribution is characterized by the density function 



fi^)-^-Y^^''~'^M-Lz/c}, (1) 

being c > the mean, z > and L > 1, denoted r{L, L/c). This is an adequate 
model for homogeneous regions as, for instance, pastures over flat relief. The 
law has density function 

■'^ ' 7"r(L)r(-a) (7 + Lz)i-"' 

where —a, 7,2; > 0, L > 1, denoted C/°(a,7, L). This distribution was proposed 
as a model for extremely heterogeneous areas [7], and Mejail et al. |15ll6j 
demonstrated it can be considered a universal model for speckled data. 

Data obeying the F law are referred to as "fully developed speckle" , meaning 
that there is no texture in the wavelength of the illumination (which is in the 
order of centimeters). The absolute value of the parameter a in Equation ([T]) 
is, on the contrary, a measure of the number of distinct objects of size of the 
order of the wavelength with which the scene is illuminated. As a — > —00, the 

distribution becomes the F law. 



3 Generalized Measure of Statistical Complexity 

The information content of a system is typically evaluated via a probability 
distribution function (PDF) describing the apportionment of some measurable or 
observable quantity (i.e. a time series S{t)). An information measure can primarily 
be viewed as a quantity that characterizes this given probability distribution P. 
The Shannon entropy is often used as a the "natural" one |2T]. Given a discrete 
probability distribution P = {pi : i = 1, • • • , M}, with M the degrees of freedom. 
Shannon's logarithmic information measure reads S[P] = — Pi l^(-Pi)- It 
can be regarded as a measure of the uncertainty associated to the physical 
process described by P. From now on we assume that the only restriction on 
the PDF representing the state of our system is X^jLiPj ~ 1 (micro-canonical 
representation). If S[P] = Smin = we are in position to predict with complete 
certainty which of the possible outcomes i, whose probabilities are given by pi, 
will actually take place. Our knowledge of the underlying process described by the 
probability distribution is then maximal. In contrast, our knowledge is minimal 
for a uniform distribution and the uncertainty is maximal, S[Pe] = Smax- 

It is known that an cntropic measure does not quantify the degree of structure 
or patterns present in a process |4] . Moreover, it was recently shown that measures 
of statistical or structural complexity are necessary for a better understanding 
of chaotic time series because they are able to capture their organizational 
properties [5] . This kind of information is not revealed by measures of randomness. 
The extremes perfect order (like a periodic sequence) and maximal randomness 
(fair coin toss) possess no complex structure and exhibit zero statistical complexity. 
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There is a wide range of possible degrees of physical structure these extremes 
that should be quantified by statistical complexity measures. Rosso and coworkers 
introduced an effective statistical complexity measure (SCM) that is able to 
detect essential details of the dynamics and differentiate different degrees of 
periodicity and chaos [T^]. This specific SCM, abbreviated as MPR, provides 
important additional information regarding the peculiarities of the underlying 
probability distribution, not already detected by the entropy. 

The statistical complexity measure is defined, following the seminal, intuitive 
notion advanced by Lopez-Ruiz et al. [13] . via the product 

C[P] ^ H[P] ■ D[P, Pref]. (3) 

The idea behind the Statistical Complexity is measuring at the same time the 
order/disorder of the system (H) and how far the system is from its equilibrium 
state (the so-called disequilibrium D) [14|17j . The first component can be obtained 
by means of an entropy, while the second requires computing a stochastic distance 
between the actual (observed) model and a reference one. Salicrii et al. [19120) 
provide a very convenient conceptual framework for both of these measures. 

Let fz{Z'; 6) be a probability density function with parameter vector which 
characterizes the distribution of the (possibly multivariate) random variable Z . 
The (/i, (/))-entropy relative to Z is defined by 

H';{e)^h(^jj{fz{z'-e))dz'), 

where either (p : [O, oo) — > M is concave and : M — > M is increasing, or cf) is 
convex and h is decreasing. The differential element dZ' sweeps the whole support 
A. In this work we only employ the Shannon entropy, for which h{y) — y and 
</)(a;) = —x\nx. 

Consider now the (possibly multivariate) random variables X and Y with 
densities fx{Z;9i) and /i'(Z;02), respectively, where 6i and O2 are parameter 
vectors. The densities are assumed to have the same support A.. The {h, (j))- 
divergence between fx and fy is defined by 

Diix.y)^,,{lj{M^)Mz^,o.,.z), (4) 

where h: {Q,oo) — >■ [0,oo) is a strictly increasing function with h{0) = and 
(j): (0,00) — [0,00) is a convex function such that 0(0/0) = and 0(f>{x/0) = 
lima;_j.oo 4>{x)/x. The differential element dZ sweeps the support. In the following 
we will only employ the Hellinger divergence which is also a distance, for which 
h{y) = y/2, < y < 2 and (l>{x) = (V^ - 1)^ 

The influence of the choice of a distance when computing statistical complex- 
ities is studied in Reference [TT] . Following Rosso et al. , we work with the 
Hellinger distance and we define the Statistical Complexity of coordinate (i, j) in 
an intensity SAR image as the product 



C{i,j)^H{i,j}-D{i,j), 



(5) 
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where H{i,j) is the Shannon entropy observed in (i, j) under the model, and 
D{i,j) is the observed Hellinger distance between the universal model (the Q° 
distribution) and the reference model of fully developed speckle (the F law). 

As previously noted, if an homogeneous area is being analyzed, the G'^ and 
r model can be arbitrarily close, and the distance between them tends to zero. 
The entropy of the model is closely related to the roughness of the target, as 
will be seen later, that is measured by a. 

Computing these observed quantities requires the estimation of the parameters 
which characterize the F distribution (c, the sample mean) and the G''^ law (a 
and 7), provided the number of looks L is known. The former is immediate, 
while estimating the later by maximum likelihood requires solving a nonlinear 
optimization problem. The estimation is done using data in a vicinity of (?,j). 
Once obtained c and (S,7), the terms in Equation ([5| are computed by numerical 
integration. References |1I6| discuss venues for estimating the parameters of the 
G'^ law safely. 



4 Results 



Fig ure [l] presents the main results obtained with the proposed measures. Fig- 
ure IT(aJ[ shows the original image which was obtained by the E-SAR sensor, 
an airborne experimental polarimetric SAR, over Munich, Germany. Only the 
intensity HH channel is employed in this study. The image was acquired with 
three nominal looks. The scene consists mostly of different types of crops (the 
dark areas), forest and urban areas (the bright targets). [2] Figure 1(b) shows 
the Shannon entropy as shades of gray whose brightness is proportional to the 
observed value. It is remarkable that this measure is closely related to the rough- 
ness of the target, i.e., the brighter the pixel the more heterogeneous the area. 
The entropy is also able to discriminate between different types of homogeneous 
targets, as shown in the various types of dark shades. 
Figure 



1 c 



shows the Hellinger distance between the universal model and the 
model for fully developed speckle. As expected, the darkest values are related to 
areas of low level of roughness, while the brightest spots are the linear strips in 
the uppermost right corner, since they are man-made structures. 

The Statistical Complexity is shown in Figure 1(d) It summarizes the evidence 
provided by the entropy (Figure 1(b) ) and by the stochastic distance between 
models (Figure 1(c) ). As it can be seen in the image, the values exhibit more 
variation than their constituents, allowing a fine discrimination of targets. As 
such, it stems as a new and relevant feature for SAR image analysis. 

The data were read, processed, analyzed and visualized using R v. 2.14.0 [22] 
on a MacBook Pro running Mac OS X v. 10.7.3. This platform is freely available 
at http://www.r-project.org for a diversity of computational platforms, and 
its excellent numerical properties have been attested in |2l3j . 
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5 Conclusions 

The Statistical Complexity of SAR images reveals information which is not 
available either through the mean (which is the parameter of the model for ho- 
mogeneous areas) or by the parameters of the model for extremely heterogeneous 
areas. As such, it appears as a promising feature for SAR image analysis. 

Ongoing studies include the derivation of analytical expressions for the entropy 
and the Hellinger distance, other stochastic distances, the sample properties of 
the Statistical Complexity and its generalization for other models including 
Polarimetric SAR. 
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(a) Original E-SAR image (b) Entropies 




(c) Hellinger distances (d) Statistical Complexities 

Fig. 1. Results of applying the proposed feature extraction to an E-SAR image over 
Munich, Germany 



